Call data

此次資料採用kaggle網站提供的google play store APP的資料, 先行去除無效或錯誤的資料後進行分析,主要分析應用程式的評分分布。

library(ggplot2)
library(plotly)
## 
## Attaching package: 'plotly'
## The following object is masked from 'package:ggplot2':
## 
##     last_plot
## The following object is masked from 'package:stats':
## 
##     filter
## The following object is masked from 'package:graphics':
## 
##     layout
#data from https://www.kaggle.com/lava18/google-play-store-apps
mydata = read.csv("googleplaystore.csv")

Data Structure

在分析之前,先整理此次所採計的資料。 - 採計的應用程式共9244項,共33種類型,以Family家庭類型最多,其次為Game遊戲、再者為Tools工具類型。

plot_ly(mydata, x = ~Category, color = ~Category, type = "histogram")

Rating

my.plot3 <- ggplot(mydata, aes(x = Rating))
my.plot3 <- my.plot3 +
  geom_histogram(binwidth = 0.1, fill = "steelblue")
my.plot3

plot_ly(mydata, x = ~Rating, color = ~Category, type = "box")